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Abstract — Intra-session network coding has been shown to 
offer significant gains in terms of achievable throughput and 
delay in settings where one source multicasts data to several 
clients. In this paper, we consider a more general scenario where 
multiple sources transmit data to sets of clients and study the 
benefits of inter-session network coding, when network nodes 
have the opportunity to combine packets from different sources. 
In particular, we propose a novel framework for optimal rate 
allocation in inter-session network coding systems. We formulate 
the problem as the minimization of the average decoding delay in 
the client population and solve it with a gradient-based stochastic 
algorithm. Our optimized inter-session network coding solution 
is evaluated in different network topologies and compared with 
basic intra-session network coding solutions. Our results show the 
benefits of proper coding decisions and effective rate allocation 
for lowering the decoding delay when the network is used by 
concurrent multicast sessions. 

Index Terms — Network coding, decoding delay, rate allocation, 
inter-session network coding, overlay networks. 

I. Introduction 

The recent advances in adhoc and overlay networks have 
largely contributed to the development of network coding 
algorithms Q, O. These networks are characterized by a 
variety of resources, and data is delivered from multiple 
sources to sets of clients through several overlapping paths. 
This creates network coding opportunities with possibly large 
gains in terms of throughput, delay or error robustness. How- 
ever, the assumption posed by many network coding systems 
is that the network is utilized by a single source O, lU. 
This is quite restricting as multiple sessions coexist over the 
shared network resources in most realistic scenarios. Time 
sharing or optimal bandwidth allocation of the overlapping 
paths can be considered in order to make effective use of 
shared resources. Alternatively, data from different sources can 
be combined in network nodes with so-called inter-session 
network coding algorithms |5|, |6|. These algorithms have 
gained quite a lot of interest recently as they theoretically 
permit to obtain throughput and delay gains compared to 
multiplexing solutions. This is the type of systems that we 
consider in this paper. 
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Fig. 1: Illustration of multi-source data delivery on an overlay 
network. 



More precisely, we consider lossy networks with significant 
path diversity where multiple sources attempt to simultane- 
ously deliver data to different clients, as depicted in Fig[T] 
Inter-session network coding is implemented in network nodes 
in order to exploit the path diversity and reduce the decoding 
delay without the need for explicit scheduling. Specifically, 
we employ inter-session network coding based on randomized 
linear network coding |7|. In more details, the sources encode 
the data with randomized linear coding and send the coded 
data to the downstream nodes. The packets arriving at a node 
are first stored and when a transmission opportunity occurs a 
network coded packet is sent to a child node. 

The application of inter-session network coding is not, 
however, a trivial task as random mixing of packets from 
different data flows may result in unacceptably large decoding 
delays and waste of network resources. We therefore formulate 
an optimization problem to select the appropriate coding oper- 
ations in network nodes, such that the average decoding delay 
is minimized among clients. Our target is to determine the 
optimal coding strategy in the intermediate network nodes and 
the rate allocation among the intra- and inter-session network 
coded flows. Experimental evaluation shows the validity of our 
model. The application of inter-session network coding leads 
to lower decoding delay in several network topologies, or to 
the same performance as intra-session network coding in the 
least favorable configurations. To the best of our knowledge, 
our scheme is the only scheme in the literature that provides 
a complete analytical inter-session network coding framework 
that is easily extensible to any arbitrary number of concurrent 
sources. It further shows that an effective combination of 
intra- and inter-session network coding permits to reach small 
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decoding delays even in challenging network settings. 

The design of efficient inter-session network coding 
schemes |8|, |9| is largely an open problem in the literature. 
Combinations of different flows may decrease the network 
goodput and increase the decoding delays since a larger 
number of packets has to be collected before the decoding of 
one of the flows becomes feasible. Most of the schemes in the 
literature apply inter-session network coding fS], fTOl, [Til for 
wireless scenarios and exploit transmission interferences and 
overhearing of packets intended to other clients [12J. Inter- 
session network coding in binary field is examined in |8|. This 
scheme incorporates COPE [12J, an opportunistic network 
coding scheme for wireless multi-hop networks, and applies 
coding only between two wireless sessions. A distributed 
algorithm is provided for joint coding, scheduling and rate 
control. Since the perfect scheduling is not an easy task, 
the performance loss due to imperfect scheduling is also 
examined. COPE is also employed by |13| for star shaped 
networks where a compromise between the transmission rate 
and the overhear rate is found for increasing network coding 
chances. This adaptive scheduling scheme has high complexity 
and achieves only marginal throughput gains. The work in 
pOl proposes to control the overlap of transmission paths to 
take benefit of inter-session network coding. The authors in 
ifTTIl decouple the streaming over wireless networks into two 
independent problems, namely rate-control and scheduling. 
This introduces some delays in order to increase the coding 
opportunities for wireless streaming scenarios. A distributed 
algorithm for the multi-commodity transmission in network 
with two sources is developed in |9|. Inter-session network 
coding is used along with backpressure routing and the re- 
sulting scheme has complexity comparable to that of parallel 
multi-commodity flow problem (without network coding). 

The extension of inter-session network coding to multi- 
hop wired scenarios is challenging and until now only few 
works have addressed it |6|, [14J, 1 15|. Specifically, a heuristic 
algorithm has been proposed in |j6l for data multicasting. The 
sessions are divided into groups according to a parameter 
that measures the overlap among the different sessions and 
inter-session network coding is implemented only inside these 
groups. The results show increased throughput and reduced 
bandwidth consumption; however, the delays and packet losses 
are not addressed. The work in [14J follows a pollution-free 
approach, where inter-session network coding is restricted 
only to the sources that the clients want to receive. Each 
network link is split into conceptual links that carry all possible 
combinations of flows and the clients connect to the links 
containing the sources they are interested in. When each client 
is interested in a single source, then the system performs intra- 
session network coding only. Decentralized pairwise network 
coding is proposed in ifTSl for optimized distributed rate 
control. Fairness is taken into account for achieving socially 
optimal behavior and noticing throughput gains. Although this 
approach is interesting, it is not obvious to understand if the 
results can be generalized to larger topologies and how the 
system's performance scales with the number of sources. 

The closest work to our study is the scheme considered in 
|[T6l . where the parallel application of intra- and inter-session 



network coding for introducing redundancy at intermediate 
nodes is considered. The benefits of joint consideration of 
intra- and inter-session network coding become clear in this 
work, and confirm our findings. However, the work in lT6ll 
studies a wireless scenario and different challenges become 
apparent like opportunistic listening, transmission interference 
etc. Gains in terms of throughput and resilience to network 
losses are presented in simple cross topologies and for bitwise 
XOR coding {i.e., COPE-based solutions). Our work addresses 
the delay minimization by effective coding solutions in generic 
wired multi-hop networks. 

The rest of the paper is organized as follows. In Section HIl 
we provide a brief description of the intra- and inter-session 
network coding techniques, and give an example that illustrates 
the benefits of applying inter-session network coding in multi- 
source network scenarios. We then describe in Section [Till 
the architecture of our system, which employs inter-session 
network coding for data delivery. In Section |lVl we formulate 
our novel optimization problem for optimal rate allocation in 
order to achieve minimal decoding delay. Finally, in Section IVl 
we evaluate the performance of the proposed scheme in 
different settings and Section [Vll concludes the paper. 

II. INTER-SESSION NETWORK CODING 

Network coding has been proposed as an alternative to 
the traditional routing and scheduling algorithms deployed for 
network data delivery. The main idea of network coding is 
to allow intermediate network nodes to process packets and 
transmit combinations of incoming packet^. These simple 
operations performed in the network nodes result in an in- 
creased throughput, which potentially implies a decrease in the 
delay required for data delivery. Network coding also enhances 
robustness to packet losses. 

The gain obtained by applying an inter-session network 
coding scheme can be illustrated by the classical example 
depicted in Fig. [2l In this example, we consider a butterfly 
network with two simultaneous unicast sessions. The network 
is composed of two sources and ^2, and two clients Ci 
and C2. The client Ci is interested in receiving the packets 
from the source ^2 and the client C2 is interested in receiving 
the packets from source ^i. The intermediate nodes Ii and 
I2 can either act as relay nodes by simply forwarding the 
incoming packets according to a certain scheduling policy or 
perform inter-session network coding on the input messages. 
All links have a capacity of one packet per time slot. In this 
topology, each session has only one path from the source to 
the client and paths are overlapping in the segment between 
the nodes Ii and I2. Hence, when the node Ii simply 
stores and forwards the incoming packets (Fig. [2a|), the link 
(h^h) becomes a bottleneck since it can only support the 
transmission of one packet per time slot. Depending on the 
scheduling policy that node Ii adopts, (i.e., which packet will 
be forwarded first), one of the clients experiences an extra 
delay with respect to the other client. Moreover, in this case, 
the additional bandwidth provided by the links and 

^In this paper, we use the term "packet" or "symbol" interchangeably to 
denote the elementary entity in coding operations. 
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Fig. 2: Illustration of (a) routing and (b) inter-session network coding in a butterfly network with two unicast sessions. 



(52, C2) remains unexploited, since the information delivered 
on these links is useless for the clients. It is obvious that, in 
this simple topology, routing in the intermediate nodes leads 
to a suboptimal solution and a rather conservative utilization 
of the network resources. 

On the contrary, if the intermediate node Ii implements 
inter-session network coding and combines the incoming pack- 
ets from the two sessions as shown in Fig. [20 the performance 
of the network can be improved. In particular, if the node Ii 
implements inter-session network coding, the network coded 
packet, that contains information of both sources reaches both 
clients at the same time. Both clients are able to extract the 
information that they are interested in by solving a simple 
system of equations which is formed of the network coded 
packet and the packet that the clients have received directly 
from the sources. Even though the clients are not interested in 
the information contained in the uncoded packet, this packet 
is still useful for decoding the requested data. The network 
throughput is improved with inter-session network coding and 
both clients experience the same delay. In addition, all network 
resources are fully exploited in this case. 

We provide now more details about the linear network 
coding operations that are considered in this paper. In this case, 
network coded packet is represented as a linear combination 
of the original source packets. In intra-session network coding, 
any network coded packet can be thus expressed in the form 



(1) 



k=i 



where is the /c* source packet, is the corresponding 
coding coefficient that has been randomly selected from the 
finite Galois field GF(^) of size q and Ng denotes the number 
of source packets. The bold font is henceforth used to represent 
vectors. Thus, a = [ai, . . . , aArJ and x = [xi, . . . , xatJ 



are the vectors of coding coefficients and source packets, 
respectively. 

In inter-session network coding, the operations performed 
in the network nodes consist in linear combinations of pack- 
ets from different sources. In particular, if we denote as 
Xs = [xs,i, • • • , ^s,ArJ and as = [as,i, • • • , <^s,ArJ the vectors 
of source packets and the corresponding coding coefficients of 
source s, an inter-session network coded packet can be written 
in the form 



^s,k 



(2) 



s=l 



s=l k=l 



where Xg^k represents the /c* source packet of source s and 
as^k is the corresponding coding coefficient, which is drawn 
from a finite Galois field G¥(q) according to a uniform 
distribution. S is the set of sources in the network, S = \S\ is 
the total number of sources and Ns is the number of original 
source packets of the source. Depending on the sessions 
that are encoded together in each network coded packet, some 
of the vectors as can be all zero vectors. It is worth noting 
that if as = 0, V 5 G {1, 2, (i.e., only packets 
from source j are combined to generate the packet y), Eq. (O 
becomes equivalent to Eq. ([T]) and reduces to a case of intra- 
session network coding. In order to make the decoding process 
feasible, the network coded packets are augmented with a 
header of length ^^^^ log(^) bits, which contains the 
network coding coefficients. It becomes clear that the selection 
of the size of GF(q) results from a trade-off between the 
size of the overhead and the probability of generating linearly 
dependent packets, which evolves as 0{l/q) 0. 

At the clients, the decoding of a particular source is accom- 
plished by means of Gaussian elimination on a set of packets 
when a sufficient number of intra- or inter-session network 
coded packets is collected. Specifically, upon receiving a 
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network coded packet, the client stores the body message 
of the packet in the vector y and the packet header in the 
matrix A. Thus, each row of the matrix A contains the coding 
coefficients of the corresponding encoded packet stored in 
y and encompasses all the transformations sustained by the 
packets as they travel through the network. The decoding of 
source s is possible from any subset of rows of matrix A that 
(i) contains at least Ng linearly independent rows with non- 
zero coefficient vectors at the position corresponding to source 
s and (ii) is full rank, i.e., the rank of the matrix formed by this 
subset of rows is equal to the number of variables with non- 
zero coding coefficients. Thus, if we denote as A' the matrix 
formed by the rows that satisfy the above two conditions and 
as y' the corresponding vector of encoded packets, then the 
packets of the source can be recovered by solving the 
following system of equations 

y' = A'x^, (3) 

where x = . . . , xi,Ar, , . . . , . . . , x^^at^ , . . .] is the 

vector of the original source packets from all S sources. It is 
worth mentioning that x cannot be decoded fully in Eq. ©, 
and only the symbols of the source are primarily recovered. 

To summarize, the decoding of source s can be performed 
from intra-session network coded packets, inter-session net- 
work coded packets or from a combination of packets from 
both categories. The last two cases lead to decoding additional 
packets along with the packets of source s. We assume 
that these unnecessary packets are simply dropped by the 
client, which is only interested by data from one source. 
By construction of the network coded packets, the coding 
coefficients' vectors contain many zeros. In an attempt to 
reduce the decoding complexity, the rows and the columns of 
the matrix A' can be reorganized such that the resulting matrix 
is in the row echelon form. The vectors y^ and x also have 
then to be reordered correspondingly, so that Eq. © yields a 
valid expression. 

Even if inter-session network coding appears as a natu- 
ral extension of intra-session network coding, the design of 
effective coding solutions is not trivial. While intra-session 
network coding has been shown sufficient to achieve the 
maximum rate in the case of a single multicast f2l, fTTl, the 
optimal inter-session network coding solution is still an open 
probelm. The coexistence of several sources in the network 
poses challenges on the construction of the network codes. 
Random mixing of all the input packets in the intermediate 
network nodes obviously results in suboptimal performance 
since it would then be necessary to send information about all 
sources, blindly to all clients. However inter-session network 
coding can still be more efficient than solutions based on pure 
intra-session network coding 1 18|. Clearly, there exists a trade- 
off between the increase in the information content per packet 
(i.e., the number of sessions that are mixed together), and the 
decoding delay experienced by the clients. In addition, some 
of the clients might even not have sufficient bandwidth to 
collect the necessary number of inter-session coded packets 
for decoding the source that they have subscribed to. In the 
following sections, we focus on the problem of identifying the 
cases where inter-session network coding is beneficial and we 



design coding strategies that are able to exploit efficiently the 
available network resources. 

III. Coding system description 

We now describe in details the system that we consider 
in this paper. The network is modeled as a directed acyclic 
graph Q = (V,f), where V is the set of network nodes and 
£ the set of network links. The links are characterized by a 
capacity Cij , which is expressed in packets per second, and an 
average packet loss rate nij, where (i, j) G £ denotes the link 
between the nodes i and j, with z, j G V. We also assume that 
the propagation delay on each link is negligible. 

We assume that our network consists of a set of S sources 
S = {s}, s = 1, 2, 5, a set of M network clients C = {c}, 
c = 1,2,...,M and a set of intermediate network nodes X. 
We thus have V = S U C U I. An illustration of the 
system that we consider is depicted in Fig. [T] The sources 
simultaneously transmit uncorrected information to the net- 
work clients. Each client is interested in receiving only one 
of the sessions. Prior to transmission by the sources, random 
linear combinations of the original packets that belong to the 
same sources are performed. This permits to generate packets 
of equal importance at each source and thus to alleviate the 
need for precise packet scheduling mechanisms. Moreover, it 
enhances the network resilience to packet losses, since any 
subset of Ns linearly independent coded packets is sufficient 
for decoding. Finally, it provides high packet diversity to the 
system, which enables an easier and better exploitation of the 
network resources. 

Intermediate network nodes perform the task of storing, 
combining and forwarding packets to the next hop nodes in 
a push-based policy. In particular, intermediate nodes store 
the incoming packets, randomly combine them and forward 
the resulting packets on the output links, whenever there is 
a transmission opportunity. However upon receiving a packet, 
the nodes first examine whether this packet is innovative with 
respect to the packets stored in the nodes' buffers. A packet is 
considered innovative when it increases the rank of the system 
formed by the set of packets that exist in the node's buffer. 
In other words, a packet is classified as innovative when it 
cannot be generated by simply recombining the packets that 
are already stored in the node. Packets that are non-innovative 
are dropped immediately as they do not provide any novel 
information. The innovative packets are processed and stored 
in the nodes' buffers which are considered to be large enough 
to accommodate all the received packets. 

The percentage of the intra- and inter-session network coded 
packets that are forwarded by the nodes is determined by 
the coding policy. The design of effective coding policies is 
exactly the topic of this paper. In that perspective, we further 
need to define packet types and notation. Let us define as 
T = {t}, t = 1, 2, 2*^ — 1, the set of possible packet types 
that can be generated in the network. Every packet type t 
represents a particular combination of sources including intra- 
session network coded packets. With every packet type t e T 
we associate a subset of packet types Tt and a subset of 
sources St ^ S. The elements of % are all the packet types 
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TABLE I: Notations. 



Cij 


capacity of the link (i, j) in packets/sec 


TTij 


packet loss rate on the link 


Ns 


number of packets in source s 


T 


set of all possible packet types that can be generated in the 
network 


Tt 


set of all packet types that can be used to form packets of 


St 


set of all sources that are combined to generate packets of 
type t 


Tt,s 


set of all packet types that can be used to form packets of 
type t and contain source s 




set of parent nodes of node i 


Vi 


set of children nodes of node i 


/*• 

Jtj 


flow rate of packets of type t on link (i, j) 





innovative input flow rate of packets of type t on link (i, j) 




probability of forwarding a packet of type t on link (i, j) 




probability of decoding source s with exactly k packets at 
client c 


Pi 


probability of receiving a useful packet of type t at client c 


dc 


average delay for receiving one packet at client c 


Dl 


expected delay observed at client c for decoding source s 


D 


average expected delay 



that can be combined in order to form packets of type t. The 
subset St contains all the sources whose packets are combined 
in a network coded packet of type t. In addition, we define 
the subset Tt^s ^ Tt, which includes all the packet types in 
Tt that contain data from source s. According to our model, 
every network node forwards to its neighbor nodes all or a 
subset of all the possible packet types, depending on the type 
of the incoming packets and the coding strategies in the nodes. 
Hence, the capacity Cij of the link is partitioned into 

several packet flows, each corresponding to a certain packet 
type t, and every packet flow is allocated a rate of //^ packets 
per second. Whenever a transmission opportunity occurs on 
the link (i, j), the node selects randomly a packet type t by 
sampling a probability distribution Wij = {wlj}, t e T, which 
determines the relative number of packets of each type to be 
transmitted. In particular, w^j = f-j/cij is the probability of 
sending a packet of type t on the link (i, j). Then, the node 
randomly combines the available packets to generate a packet 
of type t that is sent on the output link. The network clients 
decode the network coded packets as described in Section [III 
and extract the information that is relevant to the session that 
they have subscribed to. The delay experienced by the clients 
for collecting a sufficient number of packets for decoding is 
driven by the number of innovative packets that they receive, 
which is a function of the flow rates and the network topology. 
Table |I] summarizes the notation used throughout this paper. 

IV. Decoding delay minimization 

As we have seen above, the coding strategy implemented 
in the network nodes is key to the effective performance 
of inter-session network coding systems. We propose in this 
section a solution for minimizing the average decoding delay 
among the client population. This can typically be achieved by 
limiting the number of packets that each client has to decode. 
We first derive a model for estimating the expected delay at 
the clients required to decode the packets of the source that 



they have subscribed to. Our objective is to find the optimal 
coding policy in the network nodes defining the number of 
packets of each type that have to be generated and forwarded 
on the output links so as to minimize the average expected 
decoding delay observed at the client nodes. We formulate an 
optimization problem and derive the optimal coding strategy 
using stochastic approximation algorithms. 



A. Computation of the expected delay 

The time delay experienced at a client before it collects 
a decodable set of packets depends on the coding decisions 
implemented in the intermediate network nodes and the inno- 
vative rate of each packet type. The expected delay observed 
at a client node can be computed by estimating the average 
number of packets that the client receives before it is able to 
decode. To this end, let us assume that the client c is interested 
in receiving data from source s and denote as the average 
delay observed at client c for receiving a sufficient number of 
packets in order to decode this data. This delay can be written 
as 

CO 

D^,=d, kP!{k) (4) 

k=N, 

where k is the number of packets that client c receives before 
being able to decode the information that it is interested in and 
P^ik) stands for the probability of decoding this information 
after receiving exactly k packets. The constant dc represents 
the time needed for receiving one packet and we approximate 
it as 

^ - 1 

where Ac denotes the set of parent nodes of client c and c^c 
represents the capacity of the network links serving the client 
c. 

It should be pointed out here that, for the sake of con- 
sistency, /c in Eq. dU represents not only the packets, either 
innovative or not that reach client c, but also the time slots 
when no packet arrives at the client. In other words, whenever 
a packet is lost on the channel or a transmission opportunity 
is skipped by a parent node, we consider it as equivalent to 
receiving a useless packet. The minimum number of packets 
needed for decoding the source s is equal to the size Nq of 
the data. Hence, the probability of decoding with less packets 
than Ns (i.e., k < Ns) is equal to zero. The upper limit of the 
summation in Eq. ^ is the maximum number of packets that 
a client may receive before it is able to decode. Theoretically, 
this number goes to infinity. In practice, however, there always 
exists a large enough number of packets K^ax for which 
the probability of decoding with less then Kmax packets is 
arbitrarily close to 1. 

The probability Pc{k) of decoding the packets of source s 
at client c with exactly k packets is the probability of forming 
a full rank system upon receiving the /c* packet, but not earlier 
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than that. This probabiHty can be written as follows 

t ~ ' ' 

E 



feT fci fc2 

k-1 



(5) 



ter 



ter 



where kt denotes the number of useful packets of type t 
out of the total number of /c — 1 packets received by the 
client c. Since P^ik) is the probability of decoding source 
s with exactly k packets and not less than k, the range of 
values of /ci, /c2, . . . , A:|7-| can be computed so as to satisfy this 
condition. In particular, the ^t^j-h packets should contain 
a subset of packets that, when increased by the /c* packet, 
yields a full rank decodable system for the source. The 
term represents the probability that a useful packet of type 
t arrives at client c. denotes the type of the last, (i.e., /c*) 
packet and is simply the probability that this packet is of 
type t*. A packet of type t is characterized as useful when 
it is innovative with respect to the packets that client c has 
already received. Hence, this probability is written as 



(6) 



which defines the probability p^ as the fraction of the total 
input innovative rate of packets of type t, r|^, over the 
total input bandwidth. After combining Eqs dH) and ([5]), we 
obtain the following closed form expression for the expected 
decoding delay at client c that decodes source s, as 



^c = 4E^rEE-- 



t*eT ki k2 



E 



E 



(7) 



ki\k2 \ ■ ■ ■ k\j-\\ 



(Ep*c)' 



The detailed development of Eq. ^ is provided in the Ap- 
pendix. 



B. Optimization of coding decisions 

We are now able to formulate the delay minimization 
problem that seeks for the optimal coding decisions at the 
network nodes and the corresponding allocation of rate among 
the different packet types. Specifically, we want to determine 
the probability distribution Wij = {wlj} according to which 
every node pushes packets on its output links, or equivalently, 
the flow rates f-j > for all the packet types and for 
all the network links. The optimization problem consists in 
minimizing the average expected delay D and can be written 



as follows 



s.t. 



argminD = argmin 



\r.\ Z^-^c 



rlj > 0, vt G r 



ter 



E4- ^ - TTy), V(i, j) G £ 

= 0' if n ( E E ''fci) = 0'^* e ^(^'■^■) e ^ 

4^ J2 E ^fci' e r, Vs e St, V(i,i) e £ 
E E ^fe^ ^ E ^* ^T,Wse St, Vi G V\S 

(8) 

The first two constraints of the optimization problem in 
Eq. ([8]) arise from the fact that the input innovative flow for 
every packet type is non-negative and the total input innovative 
flow on a link cannot exceed the effective bandwidth of the 
link, respectively. The third constraint states that the flows that 
cannot be generated due to the unavailability of some of the 
necessary component flow should not be allocated any rate. 
The fourth constraint states that, for every packet type, all the 
components have an innovative flow on the output links that 
is upper bounded by the input innovative flow. Finally, the 
last constraint is similar to the fourth constraint except for the 
fact that the total input innovative rate is upper bounded by 
the total innovative rate provided by the sources. We illustrate 
the meaning of the two last constraints with the following 
example. 

Example. Let us consider a node in the network that receives 
innovative intra- session network coded packets from sources A 
and B at rates va andrs respectively. The node is forwarding 
intra-session network coded packets of sources A and B and 
also combined packets of the two sources. This scenario is 
illustrated in Fig. \3\ If r'j^, r'^ and r'j^^ are respectively the 
innovative rates of the three types of flows delivered to the 
next hop nodes, then the third constraint in the optimization 
problem in Eq. ^ states that 

t'a < rA 

v'b < TB 

^'ab < min(rA -r'A^rs -r'^) 

The above example illustrates that the maximum rate allocated 
to the flow of combined packets is upper-bounded by the 
minimum of the available innovative rates for each component 
flow. For example, if r a — <tb —t'^, any additional rate 
allocated to the combined flow, such that r^^ > va — t'j^ does 
not carry any novel information with respect to the source 
A. Therefore, it can rather be used for intra-session network 
coded packets from source B with the same impact on the 
decodability. 

The minimization problem defined in Eq. ([8]) is in general 
non-convex and the number of variables increases with the 
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Fig. 3: Illustration of the fourth constraint in the optimization 
problem of Eq (O. The rate allocated to the combined flow 
does not exceed the minimum of the remaining packets in each 
input flow. 



number of sources and links in the network, which renders 
the search space huge. In order to find the optimal solution, 
we use the SPSA (Simultaneous Perturbation Stochastic Ap- 
proximation) algorithm |19|, which is an efficient gradient 
based stochastic algorithm for finding a good approximation 
of the global optimum in multivariate non-convex optimization 
problems. With the solution of Eq. ([8]), the flow rate for 

each packet type can be computed as ffj = ^ Cij , 

ft ^ ^ 

and the allocation vector wa with k;- • = is given for 

^ ''J Cij 

all intermediate nodes. 

V. Performance evaluation 

We now evaluate the performance of our inter-session 
network coding system with the proposed rate allocation tech- 
nique. We compare the inter-session network coding scheme 
with a baseline intra-session network coding solution where 
network coding operations are performed across packets of 
the same session. In order to obtain the optimal rate allocation 
for the baseline scheme, we simply modify the optimization 
problem in Eq. ^ and restrict the coding operations to 
combinations of packets of the same session by setting the 
rate of the inter-session network coded flows to zero. 

To validate the correctness of our model, we provide results 
for two small size network topologies illustrated in Fig. |4l 
though our findings can be extended to any arbitrary topology. 
Both topologies that we consider consist of 5* = 3 source 
nodes, M = 3 client nodes and 6 intermediate nodes. Every 
client is interested in receiving packets of only one of the 
available sources, as shown in Fig. (H The original packets 
are encoded at the sources with randomized linear coding to 
increase the symbol diversity and then forwarded to the next 
hop nodes. The size of the GF for coding operations is set 
to q = 256. We assume that the size of the data is Ns = 10 
packets for all sources and every source node is transmitting 
with a fixed rate of 3 packets/sec over each outgoing link. The 
intermediate nodes randomly forward network coded packets 
according to the rate allocation obtained from the solution of 
the optimization problem in Eq. ([5]). Finally, the clients decode 
the source of their interest upon collecting a decodable set of 
packets. 

In Fig. [5al we present the average expected delay expe- 
rienced by the network clients as a function of the links' 
bandwidth for the topology depicted in Fig. |4al The capacity 



of the links 1-10 varies in the interval [1,5] packets/sec and 
the packet loss rate is set to 5%. The proposed inter-session 
network coding scheme outperforms the baseline network 
coding scheme in the whole range of capacity values. Higher 
gain in terms of delay is observed in the presence of heavy 
bottlenecks, i.e., when the ratio of the input to the output 
bandwidth in the first hop helper nodes is high. The gain 
drops gradually as this ratio decreases. The performance of the 
inter-session scheme approaches the one of the intra-session 
network coding scheme, as the bandwidth becomes sufficient 
to transmit complete intra-session network coded sources. 

For the same network settings. Fig. [6al illustrates the optimal 
rate allocation of innovative packet flows obtained for the 
proposed inter-session network coding scheme for three link 
capacity values, namely 1, 3 and 5 packets/sec. We can 
observe that the optimal network coding strategy is to forward 
combined packets of sources Si and S3 on link 2 and to utilize 
some bandwidth on links 1 and 3 to transmit intra-session 
network coded packets that are used to decode the inter-session 
network coded packets. Thus, the system transmits on links 4 
and 10, some packets that are not explicitly useful for clients 
Ci and C3, but that are exploited to decode the inter-session 
network coded packets. This leads to reduced decoding delay 
for clients Ci and C3 at the expense of slightly increasing the 
decoding delay for client C2. Indeed, part of the bandwidth 
that is utilized to provide packets of source S2 to client C2 in 
the intra-session network coding scheme becomes dedicated 
to flows of packets that are useful to decode inter-session 
network coded packets. As the ratio of the input bandwidth 
over the output bandwidth decreases, the proportion of the 
inter-session network coded packets also drops and the optimal 
coding strategy converges to an intra-session network coding 
solution. 

The decoding delay for each client is finally depicted in 
Fig. [Til for the same network settings. We can see that for 
the whole range of capacity values, the clients Ci and C3 
have lower delay with inter-session network coding, whereas 
client C2 experiences a slightly higher delay. This yields an 
overall gain in the average decoding delay over the network. 
This is due to the fact that some of the network clients have 
very scarce resources and are significantly affected by the 
network bottlenecks when only intra-session network coding is 
implemented. For example, (see Fig.|4a|), the client C2 has two 
paths that are fully dedicated to its source of interest. However, 
the clients Ci and C3 each have only one path that share a 
common segment. Hence, the performance of the intra-session 
coding solution is limited by the bottleneck that is created 
by the overlapping paths. Moreover, the intra-session network 
coding strategy fails to fully utilise the network resources as 
some of the links cannot forward useful packets and thus 
some capacity remains unexploited. However, the overlap of 
the two paths creates opportunities for inter-session network 
coding. When packet combinations of different sources are 
allowed, the network links can be utilized to supply useful 
packets to the other clients in the network. This permits to 
better exploit the available capacity and to redistribute the 
existing network resources more fairly among the clients. 
Hence, by slightly penalizing client C2, the system manages 
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Fig. 4: Examples of two network topologies with three concurrent unicast sessions. The paths connecting the clients with the 
source of their interest are highlighted with different colors. The notation Cj{Si) implies that the client Cj requests data from 
the source So. 
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Fig. 5: Comparison of the minimum average delay achieved with the optimal rate allocation for the proposed inter-session 
network coding scheme, and the baseline intra-session network coding scheme for the topologies illustrated in Fig. (H 



to achieve a better average performance. As discussed earlier, 
the advantage of inter-session network coding decreases as the 
bandwidth increases to a value that is sufficient for transmitting 
each source at their source rate. 

We perform the same experiments on a different topology 
illustrated in Fig. [40 As previously, the capacity of the links 
varies in the range [1, 5] packets/sec and the packet loss rate is 
set to 5%. From Fig. [50 we observe that the performance of the 
proposed inter-session network coding scheme coincides with 
the performance of the baseline intra-session network coding 
scheme. This is due to the fact that the performance in this 



case is mostly driven by the available bandwidth rather than 
by the coding scheme. In particular, every client has two paths 
to the source and both paths overlap with a path of the other 
sources. Due to this symmetry in the network topology, the 
minimum delay can be achieved by allocating equal amounts 
of bandwidth to each of the sources and cannot be further 
reduced by inter-session network coding. In other words, the 
network does not create any opportunities for packet mixing 
across different sources. In fact, as depicted in Fig. [6bl the 
optimal rate allocation in this case contains only intra-session 
network coded flows. Interestingly, our scheme is however 
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Fig. 6: Optimal rate allocation of the innovative packet flows obtained with the proposed inter-session network coding scheme 
for the topologies illustrated in Fig. IH 
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Fig. 7: Comparison of the decoding delay experienced by the network clients for the proposed inter-session network coding 
scheme and the baseline intra-session network coding scheme for the topologies illustrated in Fig. (H 



generic and includes the pure intra-session network coding 
scheme as a potential solution. The above results are further 
supported by the decoding delay experienced by each client, 
as presented in Fig. (Tb] We can see that both schemes provide 
the same delay for every client, and that it cannot be improved 
by combining packets from different sources. 

In a third set of experiments, we consider the topology 
depicted in Fig. [8] that consists of 5 = 3 sources, M = 5 
clients and 6 helper nodes. The capacity of the sources' output 
links and the clients' input links is set to 30 packets/sec. The 
links that are represented with dashed lines have a capacity 
fixed at 10 packets/sec. The capacity of the rest of the links 
varies in the interval [5,30] packets/sec. We observe that the 
links in dashed lines cannot provide packets that are directly 
useful for clients if only routing or intra-session network 
coding are implemented, since they do not lie on a path 
connecting the clients with the sources that they request. 
However, these links are helpful for delivering network coded 



packets that facilitate decoding at the clients when inter- 
session network coding is implemented in the intermediate 
nodes. 

Fig. |9a| illustrates the improvement in the average expected 
delay experienced by the network clients when the dashed line 
links are added to the network. It confirms that the addition 
of these links creates opportunities for inter-session network 
coding and eventually reduces the decoding delay. On the other 
hand, it is clear that the addition of these links cannot enhance 
the performance of the system when inter-session network 
coding is not enabled. When only intra-session network coding 
is implemented, the average expected delay in the presence of 
dashed line links is identical to the one achieved without these 
links (the green line in Fig. [9a| coincides with the black line). 
This confirms the fact that the dashed line links cannot provide 
packets that are explicitly useful for clients. Finally, we can 
observe that without the dashed line links, the proposed inter- 
session network coding scheme performs close to intra-session 
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Fig. 8: Irregular multicast topology with three sources and 
five clients. The dashed line links have a fixed capacity of 10 
packets/sec. The notation Cj{Si) implies that the client Cj 
requests data from the source Si. 



network coding solution in the topology of Fig. [U 

The above conclusions are supported by the rate allocation 
that achieves the minimal average decoding delay. Fig. |9b] 
shows the optimal allocation of the input innovative rate 
for clients Cs and C5 for increasing values of capacities. 
The schemes under comparison are the proposed inter-session 
network coding scheme and the intra-session network coding 
scheme. We can notice that for the pure intra-session network 
coding scheme, the introduction of the dashed line links does 
not change the optimal rate allocation solution. For the inter- 
session network coding scheme, when there are no dashed line 
links, the optimal solution is very close to the intra-session 
network coding solution. We can thus conclude that in this 
particular topology there do not exist many opportunities for 
inter-session network coding in the absence of dashed line 
links. However, we can see that the addition of these links 
leads to a different rate allocation solution where significant 
amounts of the available bandwidth are allocated to inter- 
session network coded flows. In particular, we can observe 
that clients C3 and C3 benefit from the combination of 
sources S2 and 53; they also receive intra-session network 
coded packets from sources S2 and 53 respectively, so that 
the decoding is facilitated. Moreover, we can see that as the 
capacity of the bottleneck links increases, the rate of combined 
flows diminishes. This is in accordance with our intuition 
and the above findings. When the bandwidth is sufficient, all 
the packets can be provided as intra-session network coded 
packets. 

VI. Conclusions 

We have proposed a delay minimal solution for rate al- 
location in multihop networks. Our scheme minimizes the 
average expected delay experienced by the network clients 
by allowing intermediate network nodes to transmit packets 
that are combinations of different sources. Specifically, we 
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Fig. 9: (a) Average expected delay experienced by the network 
clients and (b) optimal allocation of the input innovative rate 
for clients Cs and C5, for the topology depicted in Fig. [8l 
when intra- or inter-session network coding is deployed. The 
cases A and B correspond respectively to the addition, or not, 
of the dashed line links. 



determine the optimal allocation of the intra- and inter-session 
network coded flows in the network nodes. We show that 
inter-session network coding achieves a better exploitation of 
the available network resources and offers significant gains in 
terms of decoding delay in networks with heavy bottlenecks. 
In less favorable topologies, inter-session network coding 
performs close to pure intra-session network coding as the 
latter is essentially a special case of inter-session network 
coding in our proposed system. The results of our simulations 
indicate that inter-session network coding can potentially in- 
troduce gains in terms of decoding delay to networks with 
heterogeneous clients, i.e., clients with different access to 
the network resources. However, the full characterization of 
networks where inter-session network coding is superior to 
intra-session network coding still remains an open question. 
Moreover, though the proposed method is generic and can 
be applied to an arbitrary number of sources, the increased 
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complexity of the solution prevents its utilization in real time 
applications. Our future work will focus on the design of 
distributed low-complexity inter-session network coding algo- 
rithms that would allow to take the optimal coding decisions 
in the network nodes in real time. 



with parameters r = ^ /ct + 1 and p = J] equal to 



(12) 



Appendix 

Let us consider a discrete random variable k with a probabil- 
ity mass function (pmf) given by P^{k) as defined in Eq. 
The mean value of k is then equal to 
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(9) 
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In more details, the mean value of k is given by 

OO CO 
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Now, any set of values {/ci, A:2, . . . , /c|7-| } that leads to success- 
ful decoding with exactly k packets, the last packet being of 
type t*, also leads to successful decoding with A:+l,/c+2,A:+3 
or more packets and the last packet of type . Thus, we can 
exchange the order of summations and rewrite Eq. (fTOl) as 
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(11) 



The summation term with respect to k in Eq. (fTTt represents 
the mean value of the negative binomial distribution NB(r,p) 



Combining Eqs (fTTI) and ([T2l) . we obtain the result in Eq. Q. 
Finally, Eq. (|7]) can be obtained by substituting the term 

OO 

kP^{k) in Eq. © with Eq. ®. 
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